Goto

Collaborating Authors

 Groton


Empowering Clinical Trial Design through AI: A Randomized Evaluation of PowerGPT

Lu, Yiwen, Li, Lu, Zhang, Dazheng, Jian, Xinyao, Wang, Tingyin, Chen, Siqi, Lei, Yuqing, Tong, Jiayi, Xi, Zhaohan, Chu, Haitao, Luo, Chongliang, Ogdie, Alexis, Athey, Brian, Turan, Alparslan, Abramoff, Michael, Cappelleri, Joseph C, Xu, Hua, Lu, Yun, Berlin, Jesse, Sessler, Daniel I., Asch, David A., Jiang, Xiaoqian, Chen, Yong

arXiv.org Artificial Intelligence

Sample size calculations for power analysis are critical for clinical research and trial design, yet their complexity and reliance on statistical expertise create barriers for many researchers. We introduce PowerGPT, an AI-powered system integrating large language models (LLMs) with statistical engines to automate test selection and sample size estimation in trial design. In a randomized trial to evaluate its effectiveness, PowerGPT significantly improved task completion rates (99.3% vs. 88.9% for test selection, 99.3% vs. 77.8% for sample size calculation) and accuracy (94.1% vs. 55.4% in sample size estimation, p < 0.001), while reducing average completion time (4.0 vs. 9.3 minutes, p < 0.001). These gains were consistent across various statistical tests and benefited both statisticians and non-statisticians as well as bridging expertise gaps. Already under deployment across multiple institutions, PowerGPT represents a scalable AI-driven approach that enhances accessibility, efficiency, and accuracy in statistical power analysis for clinical research.


PropaInsight: Toward Deeper Understanding of Propaganda in Terms of Techniques, Appeals, and Intent

Liu, Jiateng, Ai, Lin, Liu, Zizhou, Karisani, Payam, Hui, Zheng, Fung, May, Nakov, Preslav, Hirschberg, Julia, Ji, Heng

arXiv.org Artificial Intelligence

Propaganda plays a critical role in shaping public opinion and fueling disinformation. While existing research primarily focuses on identifying propaganda techniques, it lacks the ability to capture the broader motives and the impacts of such content. To address these challenges, we introduce propainsight, a conceptual framework grounded in foundational social science research, which systematically dissects propaganda into techniques, arousal appeals, and underlying intent. propainsight offers a more granular understanding of how propaganda operates across different contexts. Additionally, we present propagaze, a novel dataset that combines human-annotated data with high-quality synthetic data generated through a meticulously designed pipeline. Our experiments show that off-the-shelf LLMs struggle with propaganda analysis, but training with propagaze significantly improves performance. Fine-tuned Llama-7B-Chat achieves 203.4% higher text span IoU in technique identification and 66.2% higher BertScore in appeal analysis compared to 1-shot GPT-4-Turbo. Moreover, propagaze complements limited human-annotated data in data-sparse and cross-domain scenarios, showing its potential for comprehensive and generalizable propaganda analysis.


CardioGenAI: A Machine Learning-Based Framework for Re-Engineering Drugs for Reduced hERG Liability

Kyro, Gregory W., Martin, Matthew T., Watt, Eric D., Batista, Victor S.

arXiv.org Artificial Intelligence

The link between in vitro hERG ion channel inhibition and subsequent in vivo QT interval prolongation, a critical risk factor for the development of arrythmias such as Torsade de Pointes, is so well established that in vitro hERG activity alone is often sufficient to end the development of an otherwise promising drug candidate. It is therefore of tremendous interest to develop advanced methods for identifying hERG-active compounds in the early stages of drug development, as well as for proposing redesigned compounds with reduced hERG liability and preserved on-target potency. In this work, we present CardioGenAI, a machine learning-based framework for re-engineering both developmental and commercially available drugs for reduced hERG activity while preserving their pharmacological activity. The framework incorporates novel state-of-the-art discriminative models for predicting hERG channel activity, as well as activity against the voltage-gated NaV1.5 and CaV1.2 channels due to their potential implications in modulating the arrhythmogenic potential induced by hERG channel blockade. We applied the complete framework to pimozide, an FDA-approved antipsychotic agent that demonstrates high affinity to the hERG channel, and generated 100 refined candidates. Remarkably, among the candidates is fluspirilene, a compound which is of the same class of drugs (diphenylmethanes) as pimozide and therefore has similar pharmacological activity, yet exhibits over 700-fold weaker binding to hERG. We envision that this method can effectively be applied to developmental compounds exhibiting hERG liabilities to provide a means of rescuing drug development programs that have stalled due to hERG-related safety concerns. Additionally, the discriminative models can also serve independently as effective components of a virtual screening pipeline. We have made all of our software open-source.


Waymo's Fatigue Risk Management Framework: Prevention, Monitoring, and Mitigation of Fatigue-Induced Risks while Testing Automated Driving Systems

Favaro, Francesca, Hutchings, Keith, Nemec, Philip, Cavalcante, Leticia, Victor, Trent

arXiv.org Artificial Intelligence

This report presents Waymo's proposal for a systematic fatigue risk management framework that addresses prevention, monitoring, and mitigation of fatigue-induced risks during on-road testing of ADS technology. The proposed framework remains flexible to incorporate continuous improvements, and was informed by state of the art practices, research, learnings, and experience (both internal and external to Waymo). Fatigue is a recognized contributory factor in a substantial fraction of on-road crashes involving human drivers, and mitigation of fatigue-induced risks is still an open concern researched world-wide. While the proposed framework was specifically designed in relation to on-road testing of SAE Level 4 ADS technology, it has implications and applicability to lower levels of automation as well.


Navy Block V submarine deal brings new attack ops and strategies

FOX News

The Virginia-class, nuclear-powered, fast-attack submarine, USS North Dakota (SSN 784), transits the Thames River as it pulls into its homeport on Naval Submarine Base New London in Groton, Conn - file photo. Bringing massive amounts of firepower closer to enemy targets, conducting clandestine "intel" missions in high threat waters and launching undersea attack and surveillance drones are all anticipated missions for the Navy's emerging Block V Virginia-class attack submarines. The boats, nine of which are now surging ahead through a new developmental deal between the Navy and General Dynamics Electric Boat, are reshaping submarine attack strategies and concepts of operations -- as rivals make gains challenging U.S. undersea dominance. Eight of the new 22-billion Block V deal are being engineered with a new 80-foot weapons sections in the boat, enabling the submarine to increase its attack missile capacity from 12 to 40 on-board Tomahawks. "Block V Virginias and Virginia Payload Module are a generational leap in submarine capability for the Navy," Program Executive Officer for Submarines Rear Adm. David Goggins, said in a Navy report.


Integrating Machine Learning With Microsimulation to Classify Hypothet POR

#artificialintelligence

Purpose: Variability in patient treatment responses can be a barrier to effective care. Utilization of available patient databases may improve the prediction of treatment responses. We evaluated machine learning methods to predict novel, individual patient responses to pregabalin for painful diabetic peripheral neuropathy, utilizing an agent-based modeling and simulation platform that integrates real-world observational study (OS) data and randomized clinical trial (RCT) data. Patients and methods: The best supervised machine learning methods were selected (through literature review) and combined in a novel way for aligning patients with relevant subgroups that best enable prediction of pregabalin responses. Data were derived from a German OS of pregabalin (N 2642) and nine international RCTs (N 1320).


Futility Analysis in the Cross-Validation of Machine Learning Models

Kuhn, Max

arXiv.org Machine Learning

Many machine learning models have important structural tuning parameters that cannot be directly estimated from the data. The common tactic for setting these parameters is to use resampling methods, such as cross--validation or the bootstrap, to evaluate a candidate set of values and choose the best based on some pre--defined criterion. Unfortunately, this process can be time consuming. However, the model tuning process can be streamlined by adaptively resampling candidate values so that settings that are clearly sub-optimal can be discarded. The notion of futility analysis is introduced in this context. An example is shown that illustrates how adaptive resampling can be used to reduce training time. Simulation studies are used to understand how the potential speed--up is affected by parallel processing techniques.


Applied AI News

Blanchard, David

AI Magazine

Prairie Virtual Systems Corp. play the resulting changes. Control Ocean Surveillance Center (Chicago, Ill.) has developed a barrier-free Harvest Software Inc. (Sunnyvale, (San Diego, Cal.) is conducting sea trials design virtual reality system Cal.) has integrated a neural networkbased of a multi-beam acoustic signal detection that checks building access. The which helps designers meet the into its Harvest Operator software Navy is evaluating the neural network-based requirements of the American with system. Harvest Operator automates system's ability to operate Disabilities Act, assists in the design the entire process of receiving faxed in real time, and to detect and classify of interiors that are accessible and forms. The neural network adds intelligent acoustic signals.


Laps: Cases to Models to Complete Expert Systems

Piazza, Joseph S. di, Helsabeck, Frederick A.

AI Magazine

Contrary to many prevailing approaches to knowledge acquisition, Laps, our expert-interviewing software, begins by soliciting cases from the expert, but it does not end there. Its uniqueness lies in the fact that it interweaves knowledge gathering, organizing, and testing. Laps begins with a case in the form of a sample solution path elicited from the domain expert. This sample solution path is refined by a process called dechunking, which facilitates finding a model of the expert's reasoning process. The model guides the determination of the structure of alternatives tables at an effective level of abstraction. Once these tables have been set up, the expert is able to produce row after row on his own until a complete rule base is built. A rule generator currently produces rules in Clips or M.1 syntax.